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(54) Title: UPA, A UNIVERSAL PROTEIN ARRAY SYSTEM 



(57) Abstract 

This invention relates to ordered arrays of molecules, for instance polypeptides or proteins. Such arrays, referred to as universal 
protein arrays, are described in both macro- and microarray formats. Methods of production and use of such arrays are also described. 
Also disclosed are kits for the use of such arrays and the preparation of probes for use with them. 
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UPA, A UNIVERSAL PROTEIN ARRAY SYSTEM 
FIELD 

BACKGROUND 

•-cdc « slllls fr0 „ abnormal """" ^ """"'^ ^ " - - — 

polymerase II. However an additin , ■ " 'FIID, TFIIE, TFIIF, TFIIH and RNA 

We, 386, 569-577 1997) Bn,h h , „ 2657 -2683, 1996; Ptashne and Gann, 

T hus , in , 0 w e „ mn ,„ ni , r r 8b pro,cin - DNA '«*-• 

-HT * one orl~ " ^ *~ <°"> 

expre™ (SAGE, (Ve,™^ ^ s ! "'"^ "»* 

en ,pl 0yca M audy ^ rt|aljonshii) tetwetn expression and cand 
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and have made significant contributions to our understanding „r , 

However. Wiedge of which „ g *J££Z 7T 

expression patterns is s,i„ Siting due to the ,ack of J " ^ ^ 

e-ineintermoie^ 

Therefore, there still exists a strong need for reliable , !m . 



SUMMARY 



protein-RMAandprotein-Jiaandint. , P™e.n-prot ei n, protein-DNA, 

>nvention may be purified such that fhP ^ ■ , yp pllfle for «n the arrays of this 

suc/i that the desired protein represents at least «,n°/ „r A 
content of the preparation In nthpr . k the t0tai P rotein 

protein content of the preparation r m ° re ° f the tota ' 
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from the following groupofpolypep , des:TFMA F 

«. tr. oci. S p, G4 . H G , 147i G4 . AH> G4 . VPI6 G ; C ^; D G ™ E ; T ; f ™- ™ 

Tat, PC4-P, PC4-N PC4-C PP4 ^ or, , P '' ^' A > G4-IE, G4- 

» W. Topo , AS , SR , ; T . N ^GS^ ™I " ' <W "' T0P ° ' *" ' 

Th«p res .„„„v cml „„„ S( , pn>vides , ss , ysemp , oyjBgiteseiirray8 

^<=r:irr::;::r' io " a r ray - bistdpra,ei " ~» 

Probes for use with assays of this invention can be any molecu.es that ■ „ „• 
polypeptide Examni« „f ,. • molecules that might bind to a 

yp Pt de. Examples of probes include single-stranded nucleic acids (DNA or RVA. h m 
stranded nucleic acids (DNA nrBWA> • A) ' doubie - 

' or KNA), proteins, and ligands (eg druo, , nv ; n c 
co-factors, substrates or reaction nr.H , , 5 dru § s > tox, ns, venoms, hormones, 

contacted to an arrav of tar™ n«i • . 8 * The probe 1S 

»~ h * probe lMf , " 1 ' *" ~ " - » *— I- 

1 nis invention also provides kits fnr i a K«r 

protein interaction assays (e , u „ v § " » * ^ 

ssays (,,., universaI protem aiTay based assays) such kjts jndude ^ ^ ^ ^ 
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capable of being linked to a probe Macule, and instructions for how to use the r ■ „ 

may also be provided in the kits. A probe molecule standard (either !h 

be included in the kit. unlabeled) may also 

5 Certain probe labeling kits will also include one or more arravs for • 

substan.ia.ly pur e polypeptide mo.ecu.es. " ' '"^ " ^ ° f 

Other kits provided in this invention are used for detain 

- ^ essable , J„ i0 „ „„ ,t Pr0,tm PlaCi " S * !a """ e — ^» * 

moteuta „ sed ,„ assa) , amys ,„ lhesc m any molecule f„ r „ , 

ac,, a polypi, . . fragme „, , hereof ^ mtaures J m • 

Arrays ased i„ L me I " 8 ^ " Wide ->*• 

==™~^"===- 

da* of m „l s , ^ ^ " d ^ d - *«» P<°°« - be from any 
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BRIEF DESCRIPTION OF THE FIGURES 
Figure 1: Protein-Protein Interactions 

The universal protein array (UPA) provides quantitative detection of soecifi, , • 
interactions at different salt wash stringencies Fiu 1A h u P'Ote.n-protein 

5 from the herein-described UPA th k aut0 ^aphic signals detected 

aescnoed UPA that was incubated with "P-labeled GST-K n« ,k 
low salt buffer A 1 00 (1 00 mMKCDm „ u GST-K-p52, then washed with 

«uwmMKCI)to remove unbound probe, as descrihprf ;„ c 
shows the autoradiographic sisals detect r u ^ 3 " Fi * 18 

AlOOO^ooo!.^ ; iena,S det£Cted fr0m ^ e — = ^ after it was w ashed in high saIt 

™e I is the polypeptide target arrangement key for the array shown in Fig. IA and IB 
Fig- IC ,s a p.ctor.al representation of the relative affinities of the 48 arr, , 

the transcriptional cofactor P 52 after ,h, , yed Pr ° teins for 

«or P 32afterthearraywaswashedwithbufferA1000n00mVi^^ ^ 

units are reading units from a densitometer. KG) - Ut 

Figure 2: Protein-DNA, Protein-RNA, and Protein-Ligand interactions 

- 2 A), Pr0 ti: D i~ 

- L UPa!^^" ^.^ " CP* 2P) interaction, 

^-~^A^rzrc^ rf, ^ ,h,l,pA,, "■ 

20 inFi gU rer° rFi8 ' 1,TabIelCOn " nSthC ~^ 

Figure 3: Detection of ASF/SF2-ln, era cting Proteins Using a UPA 
6H(K)ASFr ft P r einS/Pr0te,n fr3Cti0nS W6re 2nalyZed Wh 32P-.abe.ed 

; n„ ■^^F'g-SCareautoradiographsofthebindingpattemsontheUPA.ft u- 
b -'thlOOmMKClorSOOmlKCLrespectively. m s on the UPA after washing 

Key to the abbreviations in Fie 3A- cm th» r , • , , 
P.l« „ ,„ 0ST . TBp ; ™ m """^ » "i"*"" «*»ni 8 of RNA 

> receptor; His-Hl histone Hi ■ u- t . re ceptor, TR, thyroid hormone 

1, histone Hi , Co-His, co-histones; HMG I , high mobility group protein i-asf 
aitemat,ve spl.cing factor; GST-Nu, GST-nuc.eo.in fusion- GST K GST f „ 
muscle kinase site ' ° ST fUSed W,th a s y nthe ^ heart 
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DETAILED DESCRIPTION 
I- Abbreviations and Definitions 

A - Abbreviations 
ASF: alternative splicing factor 
Co-His: co-histones 

CTD: the C-terminal domain of RNA polymerase II fused to GST 
CTFIID: flag-tagged TBP-containing TFIID complex from HeLa cells 
C4-94 1 G4.147,G4-AH,G4-VP 1 6,G4-CTF,C4-Spl,G4-EU C4 IF 

^-transcription activation domainscseex^) GaUfUSedt ° 

GST: Glutathione S-transferase 

GST-K: GST fused with a synthetic heart muscle kinase site 

GST-Nu: CST-nucleolin fusion 

His-Hl: histoneHI 
15 HMGI: high mobility group protein I 
Oct 1: B-celJ specific activator 
p52; novel transcription factor p52 
p75: novel transcription factor p75 
P75-C: C-terminal region of novel transcription factor P 75 
20 p300-C: transcriptional activator 
PC4: positive cofactor 4 

PC4. PC 4 . N PC4 . C> PC4 . AS> pC4 . ml) pQ4 ^ ^ 

m7 ' PC4 - >W: var,ous p C4 polypeptides (see Table 2) 
PCAF: a P 300/CBP-associated factor that functions as a histone 
PCAF-C: C-terminal region of PCAF 
Pol II: polymerase II 
Spl: class II gene activator 

SR: serine-arginine protein fraction prepared from HeLa eel, nuclear extracts 
RPB5, RPB6, RPB8, RPBlOa and RPBlftft- ^ . 

30 II fused to GST ? * ,nd, ' VidUa, Wbunto ° f RNA ^merase 

RXR: retinoid-X receptor 
TAF250: transcriptional coactivator 
TBP: TATA-binding protein 



35 



TFIIA, TFNB, TBP, f: TFIID, TFIIE TFIIF r Tirnu « 

HE, IIF, and I,H ^ " ^"'^ f ^ "B, HD, 

Topo I (wt), Topo 1 (mt), Topo I (m)* T nnn 

Table 2) ' ? ^ ^ . po.ypeptides (see 



WO 00/54046 



PCT/US00/06244 

-7- 



TR: thyroid hormone receptor 
UPA; universal protein array 



B- Definition*: 

S);»nd Robe* A. «o/ C c„ to «„/ 1 ' ,M4 < ,SBN M»-02.«2. 

is miniaturized so as to reauire min ■ rn.croarray", s an array that 

to requ lre m.croscop.c examination for evaluation 

"»n me at least two dimensions of the array surfW ru, • 
arrays the location of each mrt u, i . . surtace. Thus, m ordered 

l efuo™e r „f, fcearay£om , i „ slr , nMr . tionracioriiAibbrev J* Fn ^^- 

i» . p,i, of h o ri2MU „ y ; * eomro ' s - rae d " p,ie>,e samp,es - — 

Mrei " S ° f " ,ea, "^ f »f™t m «,,r, M c,ip I io„f.c,orl|A(TFIIA) 
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found a ,bo,l,,dd res ,„„ Jaddresslb , co|I TH SMirofaM , „. 



reference 



B,„d,„ g ori., eroe , lo „, A nass «i a , io „ bcw „ nnvoslJbstan 



, de,ec, ab ,. ™„, „r,h, probe associaKS wi , h , ^ 

-bind- . P°lyp«P«de s ,o different ex,e„, s , and the ,™ 

P«rs,s, afte, , he araly 1S WBlled in . higbe, b„ff„ 500 mM „ , 0M mM » 
The .en, -bindins ctarisfa of „ „, y for . 

»o U nd> pro be, s w asn e da „ ay . Tni sp .„e™ (wbieb „ ay » „ 0 posili>e ^ ^ 
*e .e, of.be my ( „ hich lists ^ „ ^ £ » 



an„ : « S o f ,bep,ober„ th o stp o, yP e plid e motales(ass „ m >uhe 
binding sues are immobilized at each addreK nn n» . v « 



»nn,„. ,be sig „ als imo , „„,„„,„ f „ rf ^ ^ ^ ^ J 

DN A( de 0 , yri bo„,,e,,, cac , d): DNAisalongcta . n lhauonBins , h (nei 



one of 



£ta ta -'^^ 8 -i".c > ,o S ine 1 „d,b >mi „ t) bo„„ dlosdeoxyrteesug „ Ki>hkht 
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m . P«lypep,id e , „ f. r . SMp si& , al . m m .. CM , . ^ ^ 

High 1 hroughp u , ge „„ mte: Applied ofgeo( , mic „ . 

Isolated: A„" iS ola«d- biologic,, c„ ra po„ tm(suchlsa „ llclcicacidm . 

nanism , n which the component naturally occurs / e other rhm m 
»l„e ,„c, ud e ac , ds „„ pro , c . ns ^ by ^ 

rs^trr*^^— ------ 

,„„ „ t ' """P"" 5 """a an.losi« of na.ural „„ cfco , i<le! 

to , W „ dra ,o mcldcatidsi „ aman „ erstai|ariona|ora||yoccm|ngnneieKite 

O„gon,,.le.,,d., A^POl.«» U clo„i* s! ,„ ence „ fupI! , atK>ul300ni , c| 

* ^ . P01ymcko , idc (such „ dna or rna) wwch b k : ; ■ 

> examplca.leas, ,5,50, 1 00 or even 200 nueleo.id.s long 

have ao„ A „r„r" C " 0,ide " "** *" ^ ^ » bo, 

« y occorr.ng p .„,o„ s , such as a „ e „ d ^ ^ _ m 

i 0^.pep,ide: An oiigop.p.ide ,s d e„ ocd as a ,i„ear m „,ec„,e of aboul 5 „ few „ ami „ 



acid residues. 



Peptide Nuc.eic Acid (P NA): An oligonucIeotjde a „ ^ 
monomers cou P ,ed by amide (oeDiirlel k„„ ... u . comprise of 

bonds. '*<P<P''«=> bonds, s»ch as am,,o acid monomers joined bypep.ide 

Probe: A modern,, m ay bind ,o orimerac, w„„ one „, more poly P e p ,id, s Aprobt as 
■he «, ,s used herein, can be any mo,ec,= , tal ,s nsed ,o ch.„e„ g e Cprobe 

arrayed po.ypep.ides „„ h , hat probs „ 0 , t£ufc _ — * 

be fr.mdifferen, and varied mo,e, u ,ar crasses. Sacb ciasses are, for ,ns..„ ^ 
o-,es,ra„de dDN .„ RNA) , o|iso . orpolypep , ides(sueh 



WO 00/54046 



PCT/USOO/06244 



- 10- 



protein fragments including domains or sub-domains, and mutants or variants of natura.. • 

- U, for e,.„p,e, raicroscopicllly ^ « £ «* 

b= use,, for ins „„« mix , ures ' " ""'"""^ »»' 

(.*. ~» fluoropnor, ta cmi , „ mma , wavt , raeits) y " ■>"*'"■• 

Probe sla „ da , d: A probe molecule for use as a control in analyzin" an an"ay Po< r 

w„h a, „a s , „„ e ^ p,,,^ „ f ^ ^ 

s,a» d ard T " " """" * ** ' < " S ""' ""*» «""• » I— 

some , abtled probe s , Jndarts ^ provMtd ^ 

".olecuic l„ s o,u,io„, h oroe, ,o ^ w„„ he , , he prob e JL* w^T 

- - ™- • — p-n ^ in ,„„ „ „ : r 
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p°lypep.,d c , „ uc i tie acld molecll|( . „ |igaj)d) <««* « " 

Pr.dnflWn.pnte A oio.ogica! mdeck „<p resstd by , 
"* " More ^y. . p„ lypeplide is ' 

Arrays according .0 the proem invention include , plurality of ^, .„ 

• w . ^ taions o „ Itt surfice „ f * y ^ 

embodiments, polypeptides are deposite<1 M md , 0 J " 2 f «, I COTa ' n 

"»»™nm, a . or imermolecular interaction, -n, k ""»»""'«» «Pprox,m.tes 

in ™i. am ys „f ,„„ i„ vemion> refcrred K 
»«": array is occupied bv a „o„l.,r ■ , P«mcul a r address 

proteins can only be ^ f r h ™ f0 ™*" — on 

««-*~ In^ I ,beSe,0fd ' fferem(tat " i "^«)P» IW .d e s.c CU py i „ 8 
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embodiments, a substantially pure protein will n> PrePara, '° n - CertaiB 

j pure protein will represent at least 60% at least lew „ , 

least 85%, at least 90%, or at least 95% or mn ru ' 8 °° /o ' at 

tleast9 5/» or more of the total protein content of the preparation 
Recombinant: A recombinant nucleic acid is one ,h« h P re P^at,on. 
occurring or has a sequence that is ma d h " ** is " 0t n ^ 

-re commonly by heln T ^ * aCC ° mP "' Shed * Chem ' Ca ' «. 

engineering techniques. ' b * S enetlc 

5 ^ iitiit":rr oto 'T a " * s,r,pptd - - - * • — 

(NHASO, and I M u ra ), r„ r insIaoce „ „,„„, > 

m,n : ,es ' us "*'"' * e «*- «■*-«. * in . low slri seoc> zZ,l 

«*•«»» another probe rr,„l e „le Pr "" '° 

JL. it^, ^ e ^ ,,,, ^ , "' ,,,,,d - ta, "-'"'»-- 
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in varia«ions of the UPA technology, the detectable probe is a specific ™ • , 
■nteracion with the polypeptides of the UPA " 

5 

onmct. the present specification, including definitions, will control In ,h„y u 
Universal Protein Arrays 

unn„ T „ • of P rote '"-molecule mteractions. Although dot blot analysis with 

P r, ed protem preparations has been used for the detection of specific antibody!" 
tactions, use of high.y purified and active recombinant or native tare protein 

P^sas^ ^^^^^^^ 
m0 ,ecu,e, 8 nat ' Ve interaCti ° nS b£,Ween P °'^ - P-be 

Known technics fall far short of the UPA invention disclosed herein ,„ the case of fa 
western blot ana.ysis, protein fractions were usuaHy analyzed by SDS PACE , 
25 to a membrane, fol.owed by denaturation H ■ electro.ransferred 

ca oy aenaturation and renaturation before probino with a mdinhh i a 
probe. On average, only 1-10% of th„ a „- • , ■ ,. radiolabeled protein 

30 Hun ihelWKw,, blot Bay. ,eai ' '» >° 100-fold mo re ssns i,ive 

w ,„« ,„„„„, of aaive pr0 , ein ^ for . ntmci . on 

hl n „- „ J ybnd SyStem> co -™no P recipitation, far-westem 

blotting and GST assays were all used onlv r„, „ • . . 

used only for protem-protcn .nteraction; the gel mobility shift. 



) 
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reprinting and eross-linking ^ ys „ ere ^ 

Given that the major part of the human genome sequence has been Hm , t a , 

89. , w, a „ d , hat mos , ac , iv[ pro|5i „ s ^ ^ Kd • £*. 32- 

^.ovirus or _ „,„ „. avai , ibi|iry ~" ^ 

0 282, 6!2 .„,. , W) „ m provMe a neh * t^ " ' ''7 " 
inaction studies. The UPA system „„, , ■„ UPA.n,ed,a,ed polypeptide 

drugs. with , r e m e„d.„s potent,,, ^disease diagnosis. *" 

1 The hT" deSCri " ed " ,m ' ^""^ ° f <™™ >™» of*, invention 
The emb.dunents ,„d examples given are mean, in no way to limit the invention 

— Choice of polypeptide targets 

The ,arge,(s, of i„« res , wlll ta „ m f ^ 

-rnple certain targets of interest ate we,, k „own and i.Coded in public databases , a Lean, 
or a s.tndar comm.rcia, database. Other targets wil, he identified from jouma, .rtides or ^ 

other techntoues. ,„ «„ aln Mima ^ ^ » • 

Prodded vi, ,„ ASC „ tex, „,e. fc , slan „ „ >ssis , ^ ^ ^ ~ 
Any polypeptides can setve as targets for use in ,h, subject arrays For instance .„ , 

» tlned ,n vartous ways, Ending p^ins ,ha,ae, ,„ a specine ce„u,ar process (,g ,„„ "I „ 
y pr.tems,, proteins ,„ are „ . ^ prae „ / , J^"' 

rotatory pathway,, proteins «„„„„ M be involved in diseases, etc. Arravscan also he pic d 
•ha, mc.ude prote.ns of a specine rype ( , g . DNA polymerases, from various different so c^s 
Arrays . the oiigopeptides „ p0 ,. ptp ,,„ ^ fo , by ^ ~ 

for the nanctio, of M ,id„al EST-linxed genes and the proteins they encode 

In essence, any cotnhination or grouping of polypeptides can be assembied together one or a 

-ofUPAsforsirnultaneousanalysisofintenactionwithoneormoreprob.ofint.J 
By way of example, th.te are approximately loo.oOOdifreren, genes in the h„ m , 

ery .ene the human genome, every pr o„i, encoded for by each human gen, can be arrayed ,„ 
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h— g eno m e. ^spe^fo^H.^.^.^^ „ „^tl b T 

10 £«A«r«:Aioco/i, etc.). and so forth. 

As demonstrated below (Examole "n I ipa , «■ • 

«»■>»» and DNA- o, RNA bind 7 " d " 

DNA o,RNA.b,„d,„ g d<,m„„ ! „f, protei ^ In c<r,ai„UPA S „fd,i s !„«„,„„ 

es ed p., yp e p „de delelions orva ^,, g |!ns|h or s serte ^ « 
» -»--» 8 '" i -*-Sh. U ,. h e s e,„ n «. Anolh „ alleraa , |vcis , coI|eaionof » n , ,C,d 

»no,ne, („o„.,ar ge0 prottin . Such (lomain ^ * * ^ .0 

- » „ impona „, ,„ k _ or s w bind ■ ~ ~r 

.ar g e, p,o,e,n and , ht prooe „ probES used |q ^ ^ ^ »"* 

2» APPliC *' i »" r ' l '=™ i '««'P".«-„ ay , e c 1 ,„o,„ gyare „ ot , imite d,„ sni d its „f 

— * ,„a, ysIi of sach faclors . upA am|ysis cou|d b; ^ » 

c„di„o„, „ we „ as ,„ nomal „ anjma , praein - 
25 from different transgenic animak « cultured cells, .potties 
Polypeptide arrajs accrdir, g ,„ i nvenlion may als0 ^ ^ 
on g e.es and „, g e,s discovered (rem, for exampls , h ,, h ., hro „ g „ pu , 

DNA nticroarrays, „, SAGE (Serial Analysis pf Oene Ellpression) ^cu „, , 

- — - „ different cance, types, or in ^ ^ « ^ 

of Hundreds offers derived from patients „„„ k „ ov(n d^e outcomes ^ 
b— .r ,ssoc,.„o„ assu y5 10 b ^ „ ^ ^ ^ ^ 
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markers, or markers predicting therapy outcome, which are associated with po.ypeptide binding 

characteristics. 

Polypeptide arrays according to this invention may also be used to help assess the ability of 
certa.n dn. g s or potentia. drugs to interact with target polypeptides, or the ability of such molecules 
5 to block the interaction of other probes with arrayed polypeptides. 

The UPAs of this invention can be used to investigate receptor specificity of different types 
of known and suspected receptor modules. Examp.es of receptors that can be investigated for 
probe-specific binding by arrays according to this invention include but are not limited to 
-.organism receptors (for instance, those found in fcngi, prot020a , and bacter , 
bacter.a. strains that are resistant to antibiotics); hormone receptors (including those involved in 
•abetes, growth regulation, vasoregu.ation, and so forth); and opiate receptors (involved in 
b.olog.cal responses, for instance to addictive drugs). 

Also envisioned are arrays that are custom produced for the researcher, with an arrayed 
collection of polypeptides tailored to a specific research project, research system etc 
> Not ,n any way intending to be limited to the list below, the following „ . , ist of the ^ of 

col.ect.ons of propeptides that can be arrayed on a UPA according to this invention: a.l or 
substantially .„ the pro teins encoded for by the genome of an organism; .„ or substantial^ all the 
prote.ns encoded for by a chromosome of an organism; proteins expressed in a cel. durin* a 
part,c ular growth phase or environmental condition; proteins expressed in a cell under a particular 
abnorma. state (such as cancer, disease, or infection); proteins expressed in cel.s at various times 
dunng the passion of a disease or condition (e g ., during progression of a tumor, 
of a chronic disease such as A.heizmers); proteins expressed in a particu.ar cel, type; proteins from a 
part.cu.ar protein fami.y { , g . , DNA polymerases, cel. surface proteins, transmembrane proteins or 
ragrnents [such as so.ub.e fragments] thereof, oncogene proteins, tumor suppressor proteins, and so 
forth); proteins that show sequence homo.ogy t0 each other; proteins that share secondary structural 
characteristics; proteins that associate to form mu.timeric complexes the suounits of a ribosome 
or a membrane ATPase); vira. epitopes; domains of proteins; proteins from different species- and 
co.lect.ons of fragments of any of these protein collections. 

- Production of su bstantially p ure ta rget polype ptide 

Polypeptides for use as targets on the subject arrays can be produced by any technique that 
y.e.ds native protein. These techniques in genera, include expression from engineered DNA 
constructs, extraction from native samples (e.g., c.inica, samples), or de novo synthesis of 
oligopeptide or polypeptide fragments. 

Expression of the target polypeptides can be carried out using vvel. known techniques For 
■nstance, partial or ful.-length cDNA sequences, which encode the protein of interest as a target on 
the UPA may be ligated into bacteria, expression vectors. Methods for expressing large amounts of 
protem from a Coned gene introduced into Escherichia co,i (£. co / 0 may be utilized for the 
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S«ch fus.on p „,eins miy „, m , de in larEe Y °' k - "*»• 

* ^ , ^ by P „ cing . arong , regutacd pra _ j:jz:z~ be 

S»,,ble methods a , eprts , n , rfinSamfe00kc(o ( ""'W 
0 oond ,„ ,„ s „ ub ,e ..die, Me.hods f „ „, rattiog proKins from , hese * « 

York, I9S9). V «'»^s»i.»b k fo,,h eE «p« ssionof ,„ 2fus|o ems ' . 

"'•"«^(^^ miler . m£MB0J 1M3)DEX ,. ' , mC ° de ' ht l' UR «»« 

,,~ .... •-"»!. l»^).pEXl- j (Stanley and Luzio, EMBOJ 

AWe 292.128, 1981). pKK.77-3 (Amann and Brosius, Cfc.40-.83 1985) and D PT r 
Moffa«,y. Moi Biol. 189:1,3, 1986). 1985) and pET-j (Studiar and 

— Choice of array f onT iat and «™ru,r» 

The UP A "I? Sie "" iCa '" ,, lheirS,rUn " re ' "-«- '-ionaii.v 

.hereof Such a myi c,„ ,„„„de. for «*a mp ,e. a, leas, 50. ,00. , 50, 200. 500, , 000 o, 5000 o, mo 
soa„ y ,e,n„ed ,o de.ee, ,he b on„ d ^ m lhe upA , , hougfc ^ > « 

e,„ b e e„ied „ i„ mos , _„„ _ fc 

need f„ ,„vesm,e„, „ s p eei„ized »d expensive reading ,,,ip„,e„, 

O.P. S.O, S.N., moMed s,l,e»n ntocellolose, P oly»i„ y l id =„. fluoride. polysryre „ e 
po.v,e„,n„„ r oe,h,,e„e. P „, ycartanlle , „ >lon , „,„ „, ; 

^f- ^ in„„ih,e „. supponsd membram) „ „ exlbk ^ 
* One t o mm e r ei, llyav „ to , emicromysysKmtacMteu!ed ^ ' ^ 

™ : fast - *- — . « c 

a patch of polymer on the surface of a glass slide. 

In general. , ma „„ „» arravstoM fe ^ ^ ^ ^ 

d,s,,„ su ,shed fa. si 8 „„s of„ei,„.. r ,„ g .a^, e„„e, * the „ k e d , y , „ by 

sc,„„,„ g o, ,e, di „, by , piece of e , uipraem „ wj , h ^ ^ ^ _ ^ 

Maero-UPAs are often lrray e d .„ memb „ n(s e||her ^ ^ ^ ^ ' 
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•f »y si*, bu, ^ lcal , y wi „ te ^ tha „ , square eem . meicr 

"b^srtd.jte, fiber, plastic ,„d metal. Macros ,,,«„,„,,„ . J"'*" 0 "" 5 "' 
arrangement 0. the mac,o, ray „ suc h , hal spoB CM ^ dtaj 

TH S corr e la,,o„, s „„ tneces „ 0 ,, OIhei|lvem . on ihongh 

lninicr«r n „UPA s ,,co r , monfa , llreis|teslIla 
*. order of, s,uare d centimeter „ te A r » «*. - 

it :i mm from "* Mh "- a Mo - foM *« * i 

40,000 s P o,s. Us„g m.crofabricion technoiogies, sue as pho,o,i,h„g„ p h y pion «„ d by 

lareelv d ^ r" 0f, "" y ' ,eP " <le '" 8 " *" " * PP "' ed 10 "* " - - V will be 

o, yP ep,,de applied a , each address „,„ By » 

a rnacroarra y can be applied in the a mount of a bouI I pmol or greater, for instance about3 pmol 

W *- ,s pmol, about , 0pmol , ^ , 5 pmo| _ in - 
• - ,du», spots on , mic,rra y wi „ usua||y be ^ ^ , 

•S pntol, .boo, 0.5 prnol. about 0, pmol, about 0. , pmol, , bou , , 05 pmo| „ ^ *■ 
In addmon, the surface area of sample application r „ r each "spot" will influence how . 
Pol» P =ptide is immobilized on the array surface Thus . I, I 

wilt. .. Thus, a larger snot (having a greater surface areal 

-general,, accept »"^i- 8 r,.e, m o„„,of,ar s e,mo.ecu„ m ,nasma„er s a m p„ spo , ' 

(having a smaller surface area). P 
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The target polypeptide itself (e,., the length of the polypeptide, its pr i mary and secondary 
~ re , lts binding charactcrjs , cs jn relatjon (o ^ array substratej fitc } influence h ^ ^ 

each target polypeptide is applied to an array. Optima, amounts of targe, molecule for appIicaIion t0 
an a ay he on ^ ^ ^ ^ ^ ^ ^ ^ 

targetpolypept^etoanarraysurfaceandprobingthea^ 

de.erm.ne of range of target molecule amounts that produce intertable results 

Another way t0 describe an array is its density - the number ofsamp.es in a certain 
.panned surface area. For macroarrays of the current invention, array density will usua.ly be 
^^^P*^*^^^^^ , 0en 

.he array substrate) to about 50 targets per scared centimeter (50 targets within a I cm by I c'm 
reg-on of lhe substrate). For microarrays, array density wil, usua.ly be one target per squared 
cenumeter or more, for instance about 50, about ,00, about 200, about 300, about 400, about 500 
about , 000, about , 500, about 2,500, about 5,000, about , 0,000, about 50,000, about 1 00 000 or ' 
I * more targets per squared centimeter. 

^ Application of targets to array* 

Targets on the ar ray may be made ofoligopeptides, po ly pe P ,ides, proteins, or fragments of 
these mo.ecu.es. O.igopeptides, containing between about 8 and about 50 linked amino acids can be 
synthes.zed read.iy by chemical methods. Photolithographic techniques a„ow the syndesis of 
-0 hun reds of thousands of different types of o.igopeptides ,o be separated into individu, spots on a 
s-ngle chip, in a process referred to as ,„ sUu synthesis, as has been done with oligonuc.eotide arravs 

Longer polypeptides or proteins, on the other hand, contain up to severa. thousand amino' 
acd res.dues, and are no, as easily synthesized through in vi ,ro chemica. methods .nstead 
polypeptides and proteins for use in UPAs are usua.ly expressed using one of severa. well known 
-5 ceHular expression systems, including those described above. Alternatively, proteins can be isolated 
from the.r native environment, for instance from tissue samples or environmental samples, or from 
express,on chambers in the case of engineered expressed polypeptides. After extraction and 
approbate purification, ,he po.ypeptide can be deposited onto the array using any of a variety of 
techniques. J 

0 In the methods disclosed in this app.ica.ions, target polypeptides can be delivered to the 

substrate of the array by various different mechanism, One is by flowing within a channel defined 
on predefined regions of the array substrate. Typica. "flow channel" app.ication methods for 
applymg the polypeptides to arrays of the presen, invention are represented by dot-b.ot or slo-b.ot 
systems (see, , g ., U.S. Patents No. 4,427,4,5 and 5,283,039). One a.temativc method for app,y in „ 

> the targets ,o the array substrate is "spotting" the target po.ypeptide on predefined regions (each " 
corresponding to an array address). ,n a spotting technique, the target mo.ecules are delivered by 
d.rect.y depositing (rather than flowing) relatively smalI quanti , es of ^ ^ ^ 
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each s,op T y p,c aI dispense* i«,u de m ink . ja pri „„ r „ , m dt| . ver » 

simultaneously. 5 

» .snuffers of u,e UPA s yS ,e m ,,,, low or bi „ buffe „ stfipping toirers) ^ 
no more , ta 25% , „ 0 m0 , e , han 20% , „ 0 _ , ha „ l5% no _ 
-.5 A r ». m .,e ta 3-/. „, u,e polypeptide on the array . , be end of ,„ c , ' 

•ashed olT using buffers of the UPA system. 

bin, ,b , DtPe "'"? °" ^ S,to S " bSm " e "»-ib, y 

b n he ,a,ge, ma , ou , faber lijkmg beiag ^ ^ ni ^ ||ute ^ pvDF ^ 

in othe, ,„«, , linkblg „ bi]d . g procejs must ^ p5rfom]cd ^ Mjure ^ )• 

po ypepti*,. Examp.es of Unking procasses a „ ^ ,„ ^ of ^ ^ ^ ^ ^ ^ 
s» s„ tts to « q „ ire sacb . , inking process , n ofder to b . nd ^ widt ^ 

P»'Wi*sop,io„a„ ymay be affi , ched , ofte „ aysllbsliaIetl]rou8h „ nkermo|ecu]es 

in cnain en.bodin.en,,, .he „o„. sa „, ple regions „f lhe arraj surfa „ ^ 
^»u r face,ha,«o».,c„„ tti „^, motoIcs)areblockedi „ orierioprcvemorinhib 
of the probe molecules directly to the array surface. 

It is beneflcia, in certain embodiments to app,y a known amount of each target polypeptide 
on he array. In particular embodiments, an essentially e qual amount of each ,ar g et po.ypeptLe is 
app , to each spot. Quantification and equivalent application of the targets permits comparison of 
Probe b lndl n g affmi* b^en the differem targets . ^ ^ ^ 

protems may be carried out through many techniques well known in the an. These include * 
quantitative itnm.ob.ot analysis, e n2yme activity assays (where appropriate), and commercial.y 
ava-lab, protetn quantification kits ( , g ., Bio-Rad protein assay systems), which determine the 
concentration of protein in a sample regardless of biologica. characteristics of the specific protein 

being measured. ^win 

Many other techniques could be used to measure the amount of a target protein present in a 
sample. For mstance, the amount of target protein in a sample could be measured using a 
quantitative en^e-.inked immunosorbent assay ^ as described by , 

ai (Placenta 18:155-61,1 997). 

>n certain arrays of the invention, referred to as poo.ed arrays, at .east one particular address 
on the array ls occupied by a poo.ed mixture of more than one substantia,,, pure target p 0l y peptide . 
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All of the addresses on the array may contains pools of polypeptide, or only some of the addresses 
depending on the use of the array. For instance, in some circumstances it may be desirable to array a 
•arget polypeptide associated with one or more non-target polypeptides, for instance a stabilizing 
polypeptide or linker molecule. In addition, the native conformation of certain binding sites on 
. proteins can only be assayed for probe binding when the target polypeptide is associated with other 
mo.ecu.es, for instance when the target polypeptide native.y exists as one subunit of a mu.timeric 
complex. Poo.ed arrays of the current invention include those on which one or more of the addresses 
contams a mu.timeric polypeptide complex. In the case of such an array, i, is envisioned that 
Afferent probe molecules may bind to different polypeptides within the complex of "targe." 
polypeptides. 

Although the identity of each probe in the poo.ed mixture at a specific address is known the 
mdw.dua. probes in the pool are no. "separately addressable." The binding signa. from a poo.ed ' 
address ,s the binding signa. of the set of different (bu, mixed or associated) polypeptides occupyino 
that address. In general, an address is considered to display binding of a probe molecule if at least " 
one polypeptide occupying the address binds to the probe molecule. 

Arraying pooled samples is also a powerful tool in high-throughput technologies for 
increasing the information tha, is yielded each time the array is assayed. Methods for analyzing 
s,gnals from arrays containing pooled samples have been described, for instance in U.S. Patent No. 
5,744,305, incorporated herein by reference in its entirety. 
E. Choice of probe molecule^ 

Any molecule that might bind to or interact with one or more polypeptides can be used as a 
probe w,th the disclosed arrays. In specific embodiments of the current invention, probes may be 
from different molecular classes (e.g., nucleic acids, oligo- or polypeptides, or various types of 
hgands). Probes (especially those .ha. are polymeric chains) may be of various lengths, and different 
results may be obtained from the same array by using related probe molecules of different length 
Likewise, varying the sequence of polymeric chain probes may provide valuable binding data" 

Though in many embodiments of the invention a single type of probe molecule (for instance 
one protein) a, a time will be used to assay the array, in some embodiments, mixtures of probes will 
be used simultaneously, for instance mixtures of two proteins or two nucleic acid molecules 
Simultaneous multiple-probing (e.g. double-probing) can be used to detect either competitive bindin* 
or bmdmg systems tha. require the interaction of more molecules than just one polypeptide target and 
one probe molecule. 

L Labeling and detection of p robe moleculefs) 

Usually, probe molecules used to assay the disclosed UPAs are detectable. Probes can be 
detectab.e based on their inherent characteristics (e.g., immunogenicity) or can be rendered 
de.ectab.e by being labe.ed with an Independently detectab.e tag. Such tags inc.ude fluorescent or 
lummescen, molecules that are attached to the probe, or radioactive monomers or molecules tha. can 
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be added during or after synthesis of the probe molecule. Other tags may be immunogenic sequences 
(such as epitope tags) or molecules of known binding pairs (such as members of the 
strept/avidin:biotin system). Other tags and detection systems are known to those of skill in the art, 
and can be used in the present invention. 

Labeling different probes with different tags to enable simultaneous detection of binding of 
two or more probes on the polypeptides of an array. Multiple-label challenges to an array of this 
invention can also be used to examine any competitive binding between the two arrays on different 
polypeptides of the array. For competitive binding assay S) however, only one of the probes needs to 
be detectable. 

G. Computer assisted (automated) detection and analysis of UPAs 
The data generated by assaying a universal protein array according to this invention can be 
analyzed using known computerized systems. For instance, the array can be read by a computerized 
"reader" or scanner and the quantification of the binding of probe to individual addresses on the array 
carried out using computer algorithms. Such analysis of the array can be referred to as "automated 
detection" in that the data is being gathered by an automated reader system. 

In the case of labels that emit detectable electromagnetic wave or particles, the emitted light 
(e.g., fluorescence or luminescence) or radioactivity can be detected by very sensitive cameras, 
confocal scanners, image analysis devices, radioactive film or a Phosphoimager, which capture the 
signals (such as a color image) from the array. A computer with image analysis software detects this 
image, and analyzes the intensity of the signal for each probe location in the array. Signals can be 
compared between spots on a single array, or between arrays (such as a single array that is 
sequentially probed with multiple different probe molecules). 

Computer algorithms can also be used for comparison between spots on a single array or on 
multiple arrays. In addition, the data from an array can be stored in a computer readable form. 

Certain examples of automated array readers (scanners) will be controlled by a computer 
and software programmed to direct the individual components of the reader (e.g., mechanical 
components such as motors, analysis components such as signal interpretation and background 
subtraction). Optionally software may also be provided reader to control a graphic user interface and 
one or more systems for sorting, categorizing, storing, analyzing, or otherwise processing the data 
output of the reader. 

To "read" an array according to this invention, an array that has been assayed with a 
detectable probe to produce binding (e.g., a binding pattern) can be placed into (or onto, or below, 
etc., depending on the location of the detector system) the reader and a detectable signal indicative of 
probe binding detected by the reader. Those addresses at which the probe has bound to immobilized 
polypeptide sample provide a detectable signal, e.g., in the form of electromagnetic radiation. These 
detectable signals could be associated with an address identifier signal, identifying the site of the 
complex. The reader gathers information from each of the addresses, associates it with the address 



WO 00/54046 



PCTAJSOO/06244 



-23- 

identifier signal, and recognizes addresses with a detectable signal as distinct from those not 
producing such a signal. The reader is also capable of detecting intermediate levels of signal, 
between no signal at all and a high signal, such that quantification of signals at individual addresses 
is enabled. 

Certain readers that can be used to collect data from the arrays of this invention, especially 
those that have been probed using a fluorescently tagged molecule, will include a light source for 
optical radiation emission. The wavelength of the excitation light will usually be in the UV or visible 
range, but in some situations may be extended into the infra-red range. A beam splitter can direct the 
reader-emitted excitation beam into the object lens, which for instance may be mounted such that it 
can move in the x, y and z directions in relation to the surface of the array substrate. The objective 
lens focuses the excitation light onto the array, and more particularly onto the (polypeptide) targets 
on the array. Light at longer wavelengths than the excitation light is emitted from addresses on the 
array that contain fluorescently-labeled probe molecules (i.e., those addresses containing a 
polypeptide to which the probe binds). 

In certain embodiments of the invention, the array may be movably disposed within the 
reader as it is being read, such that the array itself moves (for instance, rotates) while the reader 
detects information from each address. Alternatively, the array may be stationary within the reader 
while the reader detection system moves across or above or around the array to detect information 
from the addresses of the array. Specific movable-format array readers are known and described, for 
instance in U.S. Patent No. 5, 922,617, hereby incorporated in its entirety by reference. Examples of 
methods for generating optical data storage focusing and tracking signals are also known (see, for 
example, U.S. Pat. No. 5,461,599, hereby incorporated in its entirety by reference). 

For the electronics and computer control, a detector (e.g., a photomultiplier tube, avalanche 
detector, Si diode, or other detector having a high quantum efficiency and low noise) converts the 
optical radiation into an electronic signal. An op-amp first amplifies the detected signal and then an 
analog-to-digital converter digitizes the signal into binary numbers, which are then collected by a 
computer. 



111. Examples 

Example 1: Preparation of a UPA 

Methods and Materials 

To identify target proteins to which the transcriptional coactivator p52 will bind, the proteii 
array system for which a target arrangement key is shown in Table 1 was provided. The general 
transcription factors, activators and coactivators arrayed were overexpressed either in bacteria, 
baculovirus or in mammalian ceils and purified to near homogeneity as previously described (Chian 
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etai.EMBOJ., 12, 2749-2762, 1993; Kershnar e/a/., J Biol. Chem., 18, 34444-34453, 1998; Luo 
et a/., Cell, 71, 231-241, 1992; Jackson and Tjian, Proc. Natl Acad Set. USA, 86, 1781-1785, 1989; 
Ge et ai t Methods EnzymoL, 274, 57-71, 1996). The serine-arginine (SR) protein fraction was 
prepared from HeLa cell nuclear extracts essentially according to Zahler et at. (Genes Dev., 6, 837- 
5 847, 1992). GST-nucleolin fusion protein (GST-Nu, address I2e/f) was prepared by overexpressing 
plasmid GST-HNB (provided by Dr M. Srivastava), which contains nucleolin coding sequence 
positions 290-707, in bacteria and purified on a glutathione-Sepharose column. Glutathione S- 
transferase fused to a HMK site (RRASV) (GST-K) (Ge et ai y Mol Cell, 2, 75 1-759, 1998) was used 
as a negative control in the experiments. 

10 An average of 7.5 pmol (normalized by Bio-Rad protein assay, Bio-Rad, Hercules, CA) of 

each of the 48 highly purified proteins (or fractions) was spotted on a 12 x 8 cm nitrocellulose 
membrane using a 96-well dot blot apparatus (Bio-Rad, Hercules, CA). This apparatus provides 
sample application to a membrane to form an array arranged in twelve rows and eight columns. The 
arrangement of the polypeptide targets in the array is shown in Table 1, which corresponds to the ' 

15 array results shown in Figures 1 and 2. Each sample was duplicated in two adjacent wells to provide 
a useful internal control. 

Each sample was diluted to 100 ul with buffer A 100 ( 100 mM KCI, 1 0% glycerol, 20 mM 
HEPES Na pH 7.9, 0.2 mM EDTA, 1 0 mM 2-mercaptoethanol and 0.5 mM PMSF) and duplicated in 
two adjacent wells. Each well was rinsed with 2 x 500 p.1 buffer A 100 and the vacuum kept for 3-5 

20 minutes. After removal from the dot blot apparatus, the protein array was rinsed with two changes of 
buffer A 100. 
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Table 1* 





a o 


c 


d 


e 


f 


g h 


1 


(i) 

TFHA 1 TFIIA 
1 r 11A | 1 rllA 


(2) 

TFIIB | TFIIB 


(3) 

TBP | TBP 


(4) 

f:TFIID | f:TFIID 


2 


(5) 

true | Irllh 


(6) 

TFIIF | TFIIF 


(7) 

f:TFIIH | f:TFIIH 


(8) 

Pol II | Pol II 


3 


(9) 

DVD 1 DVD 

KAK | KAK 


(10) 
TR | TR 


(11) 

Oct 1 | Oct 1 


(12) 
Spl | Spl 


4 


(13) 

u**-y*i | U4-y4 


(14) 

G4-147 | G4-I47 


(15) 

G4-AH | G4-AH 


(16) 

G4-VP16 | G4-VP16 


5 


(17) 

C\A PTC 1 /"M rTC 

vjh-L J r | vj4-Clr 


(18) 

G4-Spl | G4-Spl 


(19) 

G4-EIA | G4-E1A 


(20) 

G4-IE | G4-IE 


6 


(21) 

G4-Tat | G4-Tat 


(22) 

PC4-P | PC4-P 


(23) 

PC4-N | PC4-N 


(24) 

PC4-C | PC4-C 


7 


(25) 

PC4-AS | PC4-AS 


(26) 

PC4-mI | PC4-ml 


(27) 

PC4-m2 | PC4-m2 


(28) 

PC4-m3 | PC4-m3 


8 


(29) 

PC4-m4 | PC4-m4 


(30) 

PC4-m5 | PC4-m5 


(31) 

PC4-m6 | PC4-m6 


(32) 

PC4-m7 | PC4-m7 


9 


(33) 

PC4-wt | PC4-wt 


(34) 
p52 | p52 


(35) 
p75 | p75 


(36) 

p75-C | p75-C 


10 


(37) 

p300-C | p300-C 


(38) 

PCAF | PCAF 


(39) 

PCAF-C | PCAF-C 


(40) 

TAF250 1 TAF250 


I] 


(4 

Topo 1 
(wt) 


1) 

Topo I 
(wt) 


(4 

Topo I 
(mt) 


2) 

Topo 1 
(mt) 


(4 

Topo I 
(wt)» 


3) 

Topo I 
(wt)» 


(4 

Topo 1 
(nati) 


4) 

Topo 1 
(nati) 


12 

1 A LL 


(4 

ASF 


5) 

ASF 


(4 

SR 
(+nucl) 


6) 

SR 
(+nucl) 


(4 

GST-Nu 


7) 

GST-Nu 


(4 

GST-K 


8) 

GST-K 



1 Abbreviations used in Table 1 are explained above, in the Abbreviations section (IA). 



Each sample was duplicated in two adjacent wells. The actual size of the membrane is 12 X 
8 cm (height X width) with eight columns and twelve rows. 



Example 2: Removal of probe molecules from 
the UPA. 

Methods and Materials 

The same universal protein array that was prepared in Example I was reused with a protein 
probe (Example 3), a dsDNA probe (Example 4), a ssDNA probe (Example 4), a RNA probe 
(Example 5) and a ligand probe (Example 6). After each use, the filter was stripped with buffer A 
containing 1 M (NH 4 ) 2 S0 4 and 1 M urea at room temperature for 30-60 minutes. Then the stripped 
array was equilibrated with buffer A 1 00 before being incubated with another probe. 
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Exampie 3: Interaction with a protein probe. 

Methods and Materials 

Purified GST-K-p52 protein (Ge et aL, MoL Cell, 2, 751-759, 1998) was labeled by heart 
muscle kinase (HMK) in a 50 ul reaction containing 10 ug of substrate protein, 40 uCi [y- 32 P]ATP 
and 10 U of the catalytic subunit of Ca-independent protein kinase A from bovine heart (Sigma, St. 
Louis, MO) at 30°C for 30 minutes. The 32 P-labeled protein was purified through glutathione- 
Sepharose beads to separate uncoupled free nucleotide. In the case of the ASF/SF2 probe, pETl la- 
6H(K)-ASF/SF2 was created by inserting the ASF/SF2 coding region into the vector pETIla-6H(K) 
(Ge et al, MoL Cell, 2, 75 1 -759, 1 998) and overexpressed in Escherichia coli cells. Recombinant 
protein was affinity purified and labeled by HMK in vitro as described above. Pre-treatment took 
place in buffer A 100 containing 1% non-fat milk at room temperature for at least 30 minutes. The 
array was then incubated with 30-50 ng probe/ml buffer A 100 (+1% milk) at 4°C for over 12 hours. 
After incubation, the array was sequentially washed with three changes of buffer A 100 (100 mM 
KCI), A500 (500 mM KC1) and A 1000 (1000 mM KC1). The resulting signals were visualized by 
autoradiography (exposure from 30 minutes to 10 hours) and quantified with a densitometer 
(Molecular Dynamics, Sunnyvale, CA). 

Results 

GST-K-p52 was labeled in vitro with [y- 32 P]ATP by HMK. (Ge et al. t MoL Cell, 2, 751-759, 
1998) and further purified through glutathione-Sepharose beads. The protein array was first treated 
with buffer A 100 containing 1% non-fat milk and then incubated with 32 P-labeled GST-K-p52 as 
described in Materials and Methods. The filter was extensively washed with buffer A containing 
100, 500 and 1000 mM KCI prior to each autoradiographic analysis. A low salt wash (with 100 mM 
KCI) allowed the detection of most possible interactions (Fig. 1A), while a high salt (with 500-1000 
mM KCI) allowed the detection of highly specific and high affinity interactions (Fig. IB). No 
significant difference was found between the 500 and 1000 mM salt washes. The relative affinity of 
each tested protein for the probe could be measured with either a densitometer or a phosphorimager 
(Fig. 1C). Among all 48 proteins (or fractions), the SR protein fraction (addresses 12c/d) and the 
recombinant GST-nucleolin (addresses 12e/Q had the highest affinities for the transcriptional 
coactivator p52. 

It has previously been shown that, in addition to the ability to interact specifically with a 34 
kDa doublet corresponding to the splicing factor ASF/SF2, p52 could also interact strongly with a 
100 kDa protein found to be present in the SR fraction by far- western blot analysis (Ge et aL y MoL 
Cell, 2, 751-759, 1998). Protein microsequence analysis indicated that the 100 kDa band isolated 
from the SR protein fraction contained two proteins, nucleolin and QNA topoisomerase I (topo I). In 
the present experiment, p52 strongly interacted with the recombinant GST-nucleolin but not with 
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topo I, either recombinant proteins expressed in baculovirus (Fig. 1, addresses 1 la-f) or naturally 
purified protein from mammalian cells (addresses I lg/h). This observation demonstrates that p52 
interacts with the nucleolin rather than the topo I present in the SR protein fraction, which is 
consistent with the recent observation that nucleolin is a component of the multiprotein complex 
associated with p52 in HeLa cells. 

Nucleolin has been implicated in regulating pre-rRNA processing (Bouvet et al, EMBO 1, 
16, 5235-5246, 1997), pre-mRNA splicing (Ishikawa et al, Mol Cell Biol, 13, 4301-4310, 1993), B 
cell-specific transcription (Hanakahi et al., Proa Natl Acad. Scl USA, 94, 3605-3610, 1997), 
unwinding DNA, RNA or DNA-RNA duplexes (Tuteja et al, Gene, 28, 143-148, 1995) and 
mediating ceil doubling time in human cancer cells (Derenzini et al. Lab. Invest, 73, 497-502, 
1995). Like the splicing factor ASF/SF2, nucleolin also contains RNP type RNA-binding domains as 
well as RGG repeats (Bouvet etal, EMBO J., 16, 5235-5246, 1997; Valdez et al, Mol Immunol, 
32, 1207-1213, 1995). Its activity can be modulated through mitosis-specific phosphorylation by 
p34cdc2 kinase or casein kinase II (Tuteja et al, Gene, 28, 143-148, 1995). Therefore, it would be 
interesting to further examine the biological significance of nucleolin interaction with the general 
transcriptional coactivator and splicing regulator p52. 

In addition to GST-K-p52, other protein probes have also been tested in the UPA system. 
Figure 3 shows the binding activity of 32 P-Iabeied splicing factor ASF/SF2, a member of the SR 
protein family, to 16 different selected proteins in a 4 by 4 array (Fig. 3A). ASF/SF2 significantly 
bound to five of the 16 proteins, including the affinity-purified TFIID complex (Fig. 3Band C, 
address 2d), retinoid-X receptor (address 3a), histone HI (address 3c), co-histones (address 3d) and 
ASF/SF2 itself (address 4b). However, after washing the UPA with 500 mM KCI, ASF/SF2 
appeared to have the highest affinity for itself (Fig. 3C, address 4b), which is in agreement with the 
previous observation that in vitro translated ASF/SF2 could strongly bind to GST-ASF/ SF2 in a GST 
pull down assay (Xiao and Manley, EMBO J., 17, 6359-6367, 1998). However, ASF/SF2 also 
showed high affinity for the TFIID complex. Since ASF/SF2 did not interact with TBP (address 2c), 
ASF/SF2 might interact directly with TBP-associated factors. Whether such an interaction reflects 
the function of TFIID or ASF/SF2 in transcription or pre-mRNA splicing or coupling of these could 
also to be investigated using the disclosed UPA technology. Taken together, these experiments 
demonstrate that UPA can be used to detect protein interactions with various targets. 

Using the same UPA, it is shown that PC4 with a single point mutation (Phe— Pro) at 
position 77 lost both dsDNA- and ssDNA-binding activity (Fig. 2A and B, addresses 8c/d), but still 
retained RNA-binding activity (Fig. 2C, addresses 8c/d). In contrast, phosphorylation of PC4 by 
casein kinase II stimulated the DNA-binding activity (Fig. 2A and B, addresses 6c/d), but reduced its 
RNA-binding activity (Fig. 2C, addresses 6c/d). These observations demonstrate that UPA is an 
effective method to map protein interaction domains and DNA- or RNA-binding domains of a 
protein. 



WO 00/54046 



PCTAJSOO/06244 



-28- 

Example 4: Interaction with a DNA probe. 
Methods and Materials 

5 To test whether the UPA system could also be used to detect interactions with other (e.g., 

biological) molecules, the same array was stripped (see Example 2) and reprobed with a 32 P-labeled 
double-stranded oligonucleotide (64 bp) containing the adenovirus major late core promoter 
elements. 

A double-stranded (ds) oligonucleotide (64 bp with plus strand 5*-GGGGGGCTATAAAA- 
10 GGGGGTGGGGGCGCGTTCGTCCTCACTCTCTTCCGCATCGCTGTCTGCG and minus strand 
S'-CCCTCGCAGACAGCGATGCGGAAGAGAGTGAGGACGAACGCGCCCCCACCCCCTTTT- 
ATAGCCC) corresponding to the adenovirus major late promoter region from -39 to +29 was 
labeled at the 3'-end of the minus strand with Klenow fragment in the presence of [ 32 P]dCTP. After 
labeling, the free nucleotides were separated from the probe by passing the labeling reaction through 
15 a G-50 nick column (Pharmacia Biotech, United Kingdom). Pre-treatment took place with buffer A 
containing 60 mM KCI, 2x Denhardfs solution and 25 ug/ml poly(dG dC) (Sigma, St. Louis, MO) at 
room temperature for 30 minutes. For interaction, 5 ng/ml of J2 P-labeled double-stranded (ds)DNA 
was added to the same buffer and incubation was carried out at 4° C for > 12 hours. The array was 
then sequentially washed with three changes of buffer A 100, A500 and A 1000 followed by 
20 autoradiography and quantification. 

To analyze the array with a single-stranded (ss)DNA probe, the 64-mer minus strand of the 
dsDNA probe was labeled at the 5'-end by T4 polynucleotide kinase in the presence of y-[ 32 P]ATP. 
Other conditions were exactly the same as those for the dsDNA probe. 

25 Results 

The results shown in Figure 2A indicate that, after washing with 500 mM salt, 
phosphorylated PC4 (PC4-P, addresses 6c/d), an inactive form of a previously described 
transcriptional coactivator (Ge et al. y Proc. Natl. Acad Set. USA, 91, 12691-12695, 1994), purified 
from HeLa cells had the highest affinity for the tested dsDNA probe among 48 samples (see 

30 quantification in Table 2). PC4-P had 3- to 5-fold higher affinity for dsDNA compared to other PC4 
derivatives, including wild-type PC4 (addresses 9a/b). In contrast, a single amino acid change at 
position 77 (Phe— *Pro) completely abolished the dsDNA binding ability of PC4 (addresses 8c/d). 
These results are in agreement with the observations reported recently using gel mobility shift assays 
that phosphorylated PC4 bound bubble DNA with higher affinity and the region around position 77 

35 was critical for the DNA-binding activity of PC4 (Werten, etai, EMBOJ., 17, 5103-5 111, 1998). 
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Although it is known that TBP can specifically bind the present probe, the signal is 
relatively weak compared to other DNA-binding proteins. This result is consistent with the 
observation from gel mobility shift assays that the binding activity of TBP to TATA box-containing 
DNA was barely detectable. However, it can be significantly enhanced by the presence of another 
transcription factor, TFIIA (Orphanides et ai, Genes Dev., 10, 2657-2683, 1996). On the other hand, 
however, many other general (non-sequence-specific) DNA-binding proteins had much stronger 
signals than TBP, suggesting that the present system may not be suitable for determining the binding 
activity of sequence-specific DNA-binding (and/or RNA-binding) proteins. ASF/SF2 was identified 
as an RNA-binding protein playing an essential role(s) in pre-mRNA splicing. Both the recombinant 
ASF/SF2 (addresses 12a/b) and the native ASF/SF2-containing SR protein fraction (12c/d) bound 
dsDNA as well as ssDNA (see below) very strongly, even tighterthan most of the DNA-binding 
proteins tested (see quantification in Table 2), indicating that ASF/SF2 is also a DNA-binding 
protein. After the array was analyzed with a ssDNA probe (Fig. 2B), although several differences 
were observed, the overall partem of protein-ssDNA interactions was similar to that of protein- 
dsDNA interactions, suggesting that most DNA-binding proteins are capable of binding both dsDNA 
and ssDNA. 

Example 5: Interaction with a RNA probe. 

Methods and Materials 

An SV40 early pre-mRNA was synthesized in vitro from the plasmid pSVi66 by SP6 RNA 
polymerase as previously described (Ge et aL y MoL Cell, 2, 751-759, 1998). Interaction was carried 
out at 4° C for >12 hours in the presence of 20 mM HEPES Na pH 7.9, 5% glycerol, 10 mM 2- 
mercaptoethanol, 0.2 mM EDTA Na pH 8.0, 60 mM KCI, 2 mM MgCJ 2 , 0.5 mg/ml BSA, 25 ug/ml 
tRNA and -5 ng/ml 32 P-Iabeled SV40 early pre-mRNA. The array was then sequentially washed and 
visualized by autoradiography as described for the DNA probes. 

Results 

This protein array system was also used successfully to analyze interactions with an RNA 
probe transcribed from the SV40 early region-containing plasmid pSVi66 (Ge et al, MoL Cell, 2, 
751-759, 1998). Several interesting observations were revealed (see Fig. 2C). First, phosphorylation 
by casein kinase II in vivo apparently decreased the affinity of PC4 for the RNA probe (addresses 
6c/d in Fig. 2C; see also Table 2), although it increased the affinity of PC4 for both the dsDNA and 
ssDNA probes (addresses 6c/d in Fig. 2A and B). Second, in contrast to the DNA-binding activity, 
the RNA-binding activity of PC4 was not significantly affected by the mutation at position 77 
(addresses 8c/d in Fig. 2C). Third, both p52 and p75 strongly bind the RNA probe (addresses 9c-h in 
Fig. 2C), but did not significantly bind either the dsDNA or ssDNA probe in this assay (addresses 9c- 
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h in Fig2A and B). Finally, PCAF, a p300/CBP-associated factor that functions as a histone 
(Ogryzko et aL, Cell, 87, 953-959, 1996), could bind the RNA probe very strongly (addresses lOc-f 
in Fig. 2C; see Table 2 for quantification), suggesting a possible role of PCAF in RNA metabolism. 



5 Example 6: Interaction with a ligand probe 

Methods and Materials 

L-3,5,3'-[ 12S I]Triiodothyronine (T3) was purchased from NEN (Boston, MA, catalog no. 
NEX1 1 OH). The interaction conditions were essentially the same as for the RNA probe except that 
10 tRNA was omitted and 0.3 uCi/ml [ l25 I]T3 was added instead of the RNA probe. 

Results 

This protein array system was also used successfully to analyze interactions with a 125 I- 
labeled ligand, T3. Only the recombinant thyroid hormone receptor bound 12S l-labeIed T3 strongly 
1 5 and specifically (addresses 3c/d in Fig. 2D). 



Table 2 



n 


Position 


Protein/source 


p52" 


ds 
DNA " 


ss 
DNA 1 


RNA" 


Function/Description 


i 


la/b 


TFI I A/bacteria a 


12.3 


0.9 


15.1 


0.3 


class 11 gene transcription 


2 


Ic/d 


TrilB/bacteria* 


0 


0 


0.2 


0 


class 11 gene transcription 


3 


lc/F 


TBP/bactcria a 


5.1 


0.5 


5.9 


35.8 


class II gene transcription 


4 


Ig/h 


f:TFI!D/HcLa b 


8 


3.3 


1.9 


12.4 


class II gene transcription 


5 


2a/b 


TFIIE/bacteria* 


0 


0 


0.2 


0.6 


class II gene transcription 


6 


2c/d 


TFIlF/bacteria a 


0.7 


0 


0,3 


1.4 


class II gene transcription 


7 


2e/f 


f:TFHH/HcLa e 


1.4 


2.5 


0.4 


4.7 


class II gene transcription 


8 


2g/h 


RNA pol II/HeLa ■ 


9 


22.4 


10.8 


6.1 


class 11 gene transcription 


9 


3a/b 


RXR/bacteria 


30 


11.2 


37.9 


34.6 


activator (retinoid-X receptor) 


10 


3c/d 


TR/bacteria 


16.3 


29.3 


27.5 


52.4 


activator (thyroid hormone receptor) 


1 1 


3e/f 


Octl/HeLa" 


10.5 


3.5 


4.7 


9 


B cell specific activator 


12 


3g/h 


Spl/HeLa< 


3.2 


2.6 


2.2 


1.3 


class 11 gene activator 


13 


4a/b 


G4-94/bacteria 


0.5 


0.2 


1 


40.2 


activator (DNA binding domain) 


14 


4c/d 


G4-I47/bacteria 


3 


0.1 


0.1 


8.7 


activator (DNA binding domain) 


15 


4e/f 


G4-AH/bactcria a 


1.9 


0.5 


0.3 


3.8 


class II gene activator 


16 


4g/h 


G4-VPl6/bacteria 


1.5 


1.3 


0 


0 


class 11 gene activator 


17 


5a/b 


G4~CTF/bacteria 


0.9 


0 


0.1 


8.2 


class II gene activator 


18 


5c/d 


G4-Spi/bacteria 


4.5 


16.6 


9.3 


76.7 


class II gene activator 


19 


5e/f 


G4-E1 A/bacteria 


1.8 


1.2 


0 


8 


class II gene activator 


20 


5g/h 


G4-IE/bacteria 


4 


0.8 


0 


0.9 


class II gene activator 


21 


6a/b 


G4-Tai/bacteria 


2.3 


1.6 


3.4 


15.7 


class II gene activator 


22 


6c/d 


PC4-P/HeLa ' 


5.4 


100 


77.5 


14.5 


coactivator (phosphorylated) 


23 


6e/f 


PC4-N/bacteria 


16.8 


2.3 


0.8 


10.6 


PC4 (C-terminal deletion) 


24 


6g/h 


PC4-C/bacieria 


30.3 


1.6 


0.1 


37.5 


PC4 (N-terminal deletion) 


25 


7a/b 


PC4-AS/bactcria 


3.9 


35 


41.6 


87 


PC4 (CKII sites mutated) 


26 


7c/d 


PC4-ml /bacteria 


3.9 


23.2 


44.4 


75.1 


PC4 K231/K29A) 


27 


7e/f 


PC4-m2/bacteria 


10.4 


23.6 


48.3 


71.9 


PC4 (K35I/K4IA) 
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28 


7g/h 


PC4-m3/bacteria 


5.4 


21.7 


41.2 


61.2 


PC4 fR27A/K"3RI/K9QA\ 


29 


8a/b 


PC4-m4/bacteria 


3.8 


31.5 


45 


100 


PC4 fR47N/K5^l/n^QA\ 


30 


8c/d 


PC4-m5/bacteria 


2.2 


1.4 


4.5 


89.5 


PC4 fF77P\ 


31 


8e/f 


PC4-m6/bactcria 


2.8 


37 


66.7 


87.3 


PC4 fK?QA* 


32 


8g/h 


PC4-m7/bacteria 


0.6 


27.4 


56.9 


75.4 


PC4 fK4 1 


33 


9a/b 


PC4-wt/bacteria 


2.8 


37 


66.7 


72.7 


transcriptional coactivator (wild type) 


34 


9c/d 


p52/bacteria * 


2.1 


0.8 


o 




transcriptional coactivator 


35 


9e/f 


p75/bacteria * 


5 


1.4 


o 


S4 f% 


transcriptional coactivator 


36 


9g/h 


p75-C/bacteria 1 






O 7 


oo.z 


coactivator (C-terminal 326-530) 


37 


lOa/b 


D300-C/hacuIoviru<; k 




1 1 ft 

1 I.U 


1 1 .J 


14 S 
IH.J 


transcriptionai coactivator (1 135-2414) 


38 


lOc/d 


PCAF/baculovirus h 


X.J 


2.4 




70.J 


histone acetyltransferse 


39 


lOe/f 


PCAF-C/baculovirus h 


5.3 


8.4 


8 


74.3 


PCAF (352-832) 


40 


lOg/h 


TAF250/baculovirus i 


1.7 


t.l 


0.6 


10.9 


transcriptional coactivator 


41 


lla/b 


Topo I/baculovirus' 


1.1 


1 


0.9 


12.4 


DNA unwinding/transcription 


42 


llc/d 


Topo I/baculovirus j 


4.7 


2.4 


2.1 


12.6 


Topo I (Y723F) 


43 


Mc/f 


Topo I/baculovirus k 


5 


1.6 


l.t 


6 


Topo 1 (wild type) 


44 


llg/h 


Topo I/HeLa 


2.3 


1.7 


0.7 


76.5 


native Topo 1 0 


45 


12a/b 


ASF/SF2/bacteria ' 


13.5 


33.5 


89 


40.4 


splicing factor (SR. protein) 


46 


12c/d 


SR/HeLa 1 


55 


80 


100 


9.7 


splicing factors (SR family) 


47 


12c/f 


GST-Nu/bacteria m 


100 


0.4 


3.3 


8.3 


pre-rRNA processing factor (nucleolin) 


48 


12g/h 


GST-K/bacteria 


2.8 


0.4 


1.5 


1.8 


negative control 



a Ge et at. Methods Enzymol., 274, 57-71, 1996 

b Chiang e/ a/., £A-/flCU, 12,2749-2762. 1993 

c Kcrshnare/a/.. J. Biol. Chem., 18, 34444-34453, 1998 

a Uoe/al..Celt 71, 231-241, 1992 

e Jackson and Tjian f Proc. Nati Acad. Sci. USA, 86. 1781-1785, 1989 

' Ge etal. . Proc. Natl. Acad. Sci. USA. 9 1 , 1 269 1 - 1 2695. 1 994 

«Ge etal.. Mot Ceil. 2, 751-759, 1998 

h Ogryzko eiaL Celt 87,953-959, 1996 

i Mizzcne/fl/..Ce// ( 87, 1261-1270. 1996 

> Wang and Roedcr. Mol. Cell, 1. 749-757. 1998 

k Pourquicr et al , J. Biol. Chem., 272. 2644 1 -26447. 1997 

1 Zahler et at. Genes Dev., 6, 837-847. 1992 

Valdez et ai. Mot Immunol, 32, 1207-12 13, 1995 
n Relative binding affinity of the specified probe to each target on the array, normalized to the highest signal for each probe 
0 TopoGEN inc.. Columbus, OH 



The number, position (address), name/source (and related reference), affinities for each probe 
and known function for each of the 48 target polypeptides are indicated. The highest affinities of the 
individualized proteins for each probe molecule [GST-nucleolin for p52 (addresses 12e/f)> PC4-P for 
the dsDNA (addresses 6c/d), SR for the ssDNA (addresses 12c/d) and PC4-m4 for the RNA 
(addresses 8a/b)] where normalized to 100 and are indicated in bold. 



Example 7: Kits 



UPAs as disclosed herein can be supplied in the form of a kit for use in molecule binding 
analyses. In such a kit, at lest one polypeptide array is provided. The kit will also include 
instructions, usually written instructions, to assist the user in probing the array. Such instructions can 
optionally be provided on a computer readable medium. 

Kits may additionally include one or more buffers for use during assay of the provided 
array. For instance, such buffers may include a low stringency, a high stringency wash, and/or a 
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stripping solution. These buffers may be provided in bulk, where each container of buffer is large 
enough to hold sufficient buffer for several probing or washing or stripping procedures. 
Alternatively, the buffers can be provided in pre-measured aliquots, which would be tailored to the 
size and style of array included in the kit. 

Certain kits may also provide one or more containers in which to carry out array-probing 
reactions. 

Kits may in addition include either labeled or unlabeled control probe molecules, to provide 
for internal tests of either the labeling procedure or probing of the UPA, or both. The control probe 
molecules may be provided suspended in an aqueous solution or as a freeze-dried or lyophilized 
powder, for instance. The container(s) in which the controls are supplied can be any conventional 
container that is capable of holding the supplied form, for instance, micro fuge tubes, ampoules, or 
bottles. In some applications, control probes may be provided in pre-measured single use amounts in 
individual, typically disposable, tubes or equivalent containers. 

The amount of each control probe supplied in the kit can be any particular amount, 
depending for instance on the market to which the product is directed. For instance, if the kit is 
adapted for research or clinical use, sufficient control probe(s) likely will be provided to perform 
several controlled analyses of the array. Likewise, where multiple control probes are provided in one 
kit, the specific probes provided will be tailored to the market. In certain embodiments, a plurality of 
different control probes will be provided in a single kit each control probe being from a different 
class of molecules (e.g., a nucleic acid probe, a protein probe, a ligand probe, etc.). 

In some embodiments of the current invention, kits may also include the reagents necessary 
to carry out one or more probe-labeling reactions. The specific reagents included will be chosen in 
order to satisfy the end user's needs, depending on the type of probe molecule {e.g., nucleic acid, 
polypeptide, or ligand) and the method of labeling {e.g., radiolabel incorporated during probe 
synthesis, attachable fluorescent tag, etc.). 

Further kits are provided for the labeling of probe molecules for use in assaying arrays 
provided herein. Such kits may optionally include an array to be assayed by the so labeled probe 
molecules. Other components of the kit are largely as described above for kits for the assaying of 
UPAs. 

In view of the many possible embodiments to which the principles of our invention may be 
applied, it should be recognized that the illustrated embodiments are only a certain examples of the 
invention and should not be taken as a limitation on the scope of the invention. Rather, the scope of 
the invention is defined by the following claims. We therefore claim as our invention ail that comes 
within the scope and spirit of these claims. 
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We claim: 



1 . A protein interaction assay comprising: 

contacting an array of substantially pure target polypeptide molecules stably 
associated with a surface of a solid support with a detectable probe molecule under conditions 
sufficient to produce binding; and 

detecting the binding. 

2. The assay of claim 1, further comprising removing unbound probe molecule prior 
to detecting the binding. 

3. The assay of claim 1, wherein the detectable probe molecule comprises a single- 
stranded nucleic acid, a double-stranded nucleic acid, a protein, or a ligand. 

4. The assay of claim 1, wherein the detectable probe molecule comprises a tag useful 
for detection. 

5. The assay of claim 4, wherein the tag is fluorescent, luminescent, or immunogenic. 

6. The assay of claim 1, wherein the array comprises a microarray. 

7. The assay of claim 1, wherein the polypeptides are associated with the support at 
discrete addresses. 

8. The assay of claim 7, wherein each address contains only one substantially pure 
target polypeptide. 

9. The assay of claim 1, wherein the binding detected is a binding pattern. 

10. An assay to determine polypeptide-binding of a probe molecule, comprising: 

(a) preparing a labeled sample of the probe molecule; 

(b) contacting the labeled sample with an array of substantially pure target 
polypeptides stably associated with the surface of a solid support under conditions sufficient to 
produce binding; 

(c) separating unbound labeled probe from the array to produce a probed array; and 

(d) detecting the binding. 

1 1 . The assay of claim 10, further comprising contacting the probed array with at least 
one additional member of a signal producing system. 

12. The assay of claim 11, wherein the contacting the probed array with at least one 
additional member of a signal producing system is prior to detecting the binding. 

13. A universal protein array, comprising 

a plurality of substantially pure target polypeptide samples provided on a solid 

support, 

wherein the samples are immobilized on the solid support in an addressable pattern. 

14. The array of claim 13, wherein each address contains only one substantially pure 
target polypeptide. 
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15. The array of claim 13, wherein the addresses are arranged in rows and columns. 

16. The array of claim 13, wherein the array is arranged in a computer readable format. 

17. The array of claim 13, comprising at least 10 different polypeptide samples. 

18. The array of claim 13, comprising at least 30 different polypeptide samples. 

19. The array of claim 13, comprising at least 100 different polypeptide samples. 

20. The array of claim 13, wherein the array comprises a microarray. 

21. The array of claim 13, wherein the solid support comprises glass, nitrocellulose, 
polyvinylidene fluoride, nylon, fiber, or combinations thereof. 

22. The array of claim 13, wherein the polypeptides comprises transcriptional factors, 
transcriptional activators, or transcriptional coactivators. 

23. The array of claim 22, wherein the polypeptides comprise TFIIA, TFIIB, TBP, 
fiTFIID. TFI1E, TFI1F, f:TFHH, Pol II, RXR, TR, Oct 1, Spl, G4-94, G4-147, G4-AH, G4-VP16, 
G4-CTF, G4-Spl, G4-E1A, G4-1E, G4-Tat, PC4-P, PC4-N, PC4-C, PC4-AS, PC4-ml, PC4-m2, 
PC4-m3, PC4-m4, PC4-m5, PC4-m6, PC4-m7, PC4-wt, p52, p75, p75-C, p300-C, PCAF, PCAF-C, 
TAF250, Topo I (wt), Topo I (mt), Topo I (wt)\ Topo 1 (nati), ASF, SR, GST-Nu, or GST-K. 

24. A kit for determining polypeptide-bindingof a probe molecule, comprising 
a polypeptide array; and 

instructions. 

25. The kit of claim 24, wherein the instructions include directions for exposing the 
probe molecule to an array of substantially pure polypeptides on a support under conditions in which 
the probe molecule is capable of binding to one or more of the polypeptides of the support to detect 
biological interactions between the probe molecule and the one or more polypeptides. 

26. The kit of claim 24, wherein the polypeptide array comprises a microarray. 

27. The kit of claim 24, further comprising a buffer. 

28. The kit of claim 24, wherein the polypeptide an*ay comprises a plurality of 
substantially pure polypeptide samples. 

29. The kit of claim 24, further comprising a probe molecule standard. 

30. The kit of claim 29, wherein the probe molecule standard comprises a label. 

3 1 . The kit of claim 28, wherein the substantially pure polypeptides comprise 
transcriptional factors, transcriptional activators, or transcriptional coactivators. 

32. The kit of claim 31, wherein the polypeptides comprise TFIIA, TFIIB, TBP, 
f:TFIID, TFIIE, TFIIF, f:TFIIH, Pol II, RXR, TR, Oct 1, Spl, G4-94, G4-147, G4-AH, G4-VP16, 
G4-CTF, G4-Spl, G4-E1A, G4-IE, G4-Tat, PC4-P, PC4-N, PC4-C, PC4-AS, PC4-ml, PC4-m2, 
PC4-m3, PC4-m4, PC4-m5, PC4-m6, PC4-m7, PC4-wt, p52, p75, p75-C, p300-C, PCAF, PCAF-C, 
TAF250, Topo I (wt), Topo I (mt), Topo I (wt)*, Topo I (nati), ASF, SR, GST-Nu, or GST-K. 

33. A method of analysis of protein-molecule interactions, comprising: 
obtaining a plurality of different substantially pure protein specimens; 
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placing a sample of each specimen in a discrete addressable location on a recipient 

array; and 

probing the array with a detectable probe molecule, 

34. The method of claim 33, wherein the array comprises a microarray. 

35. The method of claim 33, wherein the probe molecule comprises a nucleic acid, a 
polypeptide, a ligand, a fragment thereof, or mixtures thereof. 

36. A method of analyzing a plurality of binding characteristics of an array of 
polypeptide samples, comprising: 

(a) providing a protein array comprising a plurality of different polypeptide 

samples; 

(b) exposing the protein array to a first probe that may interact with the samples of 
the universal protein array to identify those samples to which the first probe binds; 

(c) detecting a first binding pattern of the first probe; 

(d) repeating (b) through (c) with a second probe to identify samples to which the 
second probe binds. 

37. The method of claim 36, further comprising stripping bound first probe from the 
array prior to exposing the array to the second probe. 

38. The method of claim 36, wherein the protein array comprises 

a plurality of substantially pure target polypeptide samples; and 
a solid support, 

wherein the samples are immobilized on the solid support in an addressable pattern. 

39. The method of claim 36, wherein the first probe and the second probe are selected 
from different classes of molecules. 

40. The method of claim 36, wherein the protein array comprises a microarray. 

41. The assay of claims 1 or 10, wherein detection is automated. 

42. The method of claim 36, wherein detection is automated. 
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